Today, I am talking about how Netflix uses AI, Data Science, and Machine Learning.
The presence of AI in today’s society is becoming more and more ubiquitous particularly as large companies like Netflix, Amazon, Facebook, Spotify, and many more continually deploy AI-related solutions that directly interact with consumers every day.
When properly applied to business problems, these AI-related solutions can provide really unique solutions that scale and improve over time, creating significant impact for both business and user. But what does it mean to “properly apply” an AI solution? Does that mean there is a wrong way? From a product perspective, the short answer is yes, and we’ll get to why that is later in this article as we dig deeper.
Overview: First, we will outline 5 use cases of
or machine learning at Netflix. We’ll then discuss some business needs vs technical considerations a Product Manager would look at. Then we will dive a little deeper into what is perhaps the most interesting of these 5 use cases as we identify what business problem it seeks to solve.
Five Use Cases of AI/Data/Machine Learning at Netflix :
- Personalization of Movie Recommendations: Users who watch A are likely to watch B. This is perhaps the most well-known feature of Netflix. Netflix uses the watching history of other users with similar tastes to recommend what you may be most interested in watching next so that you stay engaged and continue your monthly subscription for more.
- Auto-Generation and Personalization of Thumbnails: Using thousands of video frames from an existing movie or show as a starting point for thumbnail generation, Netflix annotates these images then ranks each image in an effort to identify which thumbnails have the highest likelihood of resulting in your click. These calculations are based on what others who are similar to you have clicked on. One finding could be that users who like certain actors/movie genres are more likely to click thumbnails with certain actors/image attributes.
- Location Scouting for Movie Production: Using data to help decide on where and when best to shoot a movie set given constraints of schedule, budget, and production scene requirements. Notice this is more of a data science optimization problem rather than a machine learning model that makes predictions based on past data.
- Movie Editing: Using historical data of when quality control checks have failed in the past to predict when a manual check is most beneficial in what could otherwise be a very time-intensive and laborious process.
- Streaming Quality: Using past viewing data to predict bandwidth usage to help Netflix decide when to cache regional servers for faster load times during peak demand.
These 5 use cases of data science or machine learning just in Netflix alone have had such a scalable impact that they have forever changed the technology landscape and user experience for millions and more to come. Adoption of these AI-related solutions is only going to get stronger over time.
But before these use cases were as commonplace as they are today and used by users like you and me, someone, or some group within Netflix properly connected these AI solutions with a business need. Without this business link, these use cases would simply be pie in the sky ideas sitting at the bottom of a backlog like so many other great ideas. Only through proper positioning and connection with Netflix’s core business problem did these ideas become the reality that they are today.
What is the Business Problem?
Notice in each of the use cases I’ve identified above, each one is associated with a specific business need, goal, or hypothesis.
This is absolutely important for any product manager to avoid the temptation of the tech enthusiast who marvels in the details of the data science or ML for intellectual reasons without clearly identifying the problem or business need potentially using up valuable technical resources with no business impact.
At the end of the day, product managers need to properly connect a business problem to a data machine learning solution. We want to avoid having a solution that’s chasing for a problem, otherwise, the project will lose momentum within the company: engineers won’t be clear what their North star is, stakeholders across the organization won’t buy-in and allocate the necessary resources to make the project a success, etc.
Make sure there is a problem to which an AI solution can be directly connected
Machine learning is a potential AI solution but we need to first define the problem before prescribing that solution.
What’s the business result we are trying to achieve with ML? Because this core business need is what drives the parameters of the ML models used, what data is collected and processed, etc. We don’t do ML to provide personalization just because it’s interesting tech we need to link it to a business problem.
Yes, that would be a pretty awesome use case leveraging natural language processing to understand your post-episode commentary in context. In addition to NLP, this use case uses text to voice personalities as well as sentiment analysis of how thousands of others felt about what happened in that episode, or how they feel about a certain character. Indeed, this is a beautiful merging of multiple cutting edge technologies in one use case.
If a pilot MVP version of this showed that users who engaged with his new feature stayed longer or came back more often or helped drive more word of mouth about Netflix, then it could warrant further resources. The initial decision to build that MVP would depend on the strategic decisions made by stakeholders, not necessarily prioritized by metric. That will depend on company strategy.
But as beautiful of a user scenario the above is, what problem does that solve?
How does it relate to Netflix’s main problem of keeping users subscribed every month? If it’s related, what evidence qualitative or quantitative do we have to support that relationship?
And if this is a legitimate solution to that problem, is there a simpler version of this solution that could equally accomplish that problem but be less technically complex? For example, instead of voice input and voice output, how might the complexity of just text input and text output affect the level of effort and impact on user engagement?
What if a conversational AI interface without the voice part achieved 80% of the intended user engagement but only required 40% of the development effort? Would it be worth considering such an alternative route?
What business impact would such a solution have in comparison to the level of effort? How does this ratio compare with that of other competing tasks in the backlog?
These are all product-focused questions that a PM should be asking in order to align technology solutions with business needs. Because ultimately, it’s the business need that drives the parameters of an ML model, not the other way around.
Movie Recommendations: Identifying the Problem
Here the problem is that Netflix has a huge collection of content that is constantly changing and can be overwhelming for a user to consume. Users don’t want to be frustrated in finding content relevant to their interests. So then, what is the best way to allow each user to consume that data in a way that ultimately maximizes subscription loyalty?
Product Goals include:
- Increase or maintain viewership in terms of minutes consumed,
- Increase in of titles explored, frequency of logging back in
- Exceeding whichever minimum threshold that the company determines is a success metric
- The overall increase in monthly subscription loyalty or decrease in subscriber cancellations.
Thumbnails Are Important. But What Exactly Do We Tweak?
- Increase click-thru-rates of movie recommendations signifying engagement
- The hypothesis that higher engagement rates will lead to higher subscriber satisfaction and loyalty
Product Considerations In Personalized Image Thumbnails
- Each movie should ideally have a personalized thumbnail that maximizes clicks. Since Netflix has data on clicking behavior of other people with similar interests, it is a reasonable hypothesis to guess that if other people with similar interests and watch history had a high click-thru rate on a certain thumbnail, then it is likely that this image thumbnail will perform will on a new person who hasn’t yet been recommended this movie or thumbnail.
- The personalized thumbnail should take into consideration other movies there are being recommended at the same time and what those image recommendations are. Let’s say Netflix is recommending 2 different Spiderman movies to a user side by side and they both have Spiderman facing the camera mask off. One is Tobey Maguire and the other is Andrew Garfield. Wouldn’t it be weird for the user to see both portraits of Maguire and Garfield as Spiderman with their masks off side by side? Something to account for if that ever were to occur. One image thumbnail could work well in isolation, but that may not be good enough when a page of a dozen thumbnails shows up. If they are all optimized to look the same way, then as a group, each one may seem less compelling. So looking at each thumbnail together with what else is being presented will be important.
- Data is great, but watch out for algorithms that do their job too well, resulting in unintended consequences or false positives! In statistics, they call this a Type I error falsely suggesting an image thumbnail that shouldn’t be suggested.
- There are 2 parts to this: What data does Netflix use to create these personalized thumbnails or artwork?
- What data does Netflix use to target these custom-created thumbnails to the appropriate individual?
- A 1-hour episode of Stranger Things has >86,000 static video frames
- These video frames can each individually be assigned certain attributes that are later used to filter down to the best thumbnail candidates through a set of tools and algorithms called Aesthetic Visual Analysis. This is designed to find the best custom thumbnail image out of every static frame of the video
- Netflix Annotation: Netflix creates metadata for each frame including brightness (.67), * of faces (3), skin tones (.2), probability of nudity (.03), level of motion blur (4), symmetry (.4)
- Netflix Image Ranking: Netflix uses the metadata from above to pick out specific images that are the highest quality and most clickable
- * of movies watched, # of minutes of each show watched
- % of completion for every video/series
- * of up-votes, which movies were favorites, etc
- % of overall watch content that is attributable to any specific show any seasonal or weekly trends related to a user’s level of engagement, etc.
How Netflix Uses Data to Construct A Universe of User Profile Interests
Reimagining Netflix Users in Mathematical Relation To Each Other
- “6” = romantic comedy
- “4” = thriller
What Did Netflix Learn From All This Data?
- Show close-ups of emotionally expressive faces
- Show people villains instead of heroes
- Don’t show more than three characters